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Abstract 

Global transposable characteristics in the complete DNA sequence 
of the Saccharomyces cevevisiae yeast is determined by using the met- 
ric representation and recurrence plot methods. In the form of the 
correlation distance of nucleotide strings, 16 chromosome sequences 
of the yeast, which are divided into 5 groups, display 4 kinds of the 
fundamental transposable characteristics: a short period increasing, a 
long quasi-period increasing, a long major value and hardly relevant. 
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1 Introduction 



The recent complete DNA sequences of many organisms are available to 
systematically search of genome structure. For the large amount of DNA se- 
quences, developing methods for extracting meaningful information is a ma- 
jor challenge for bioinformatics. To understand the one-dimensional symbolic 
sequences composed of the four letters 'A', 'C, 'G' and 'T' (or 'U'), some sta- 
tistical and geometrical methods were developed [H El EUlElEllZllHlElIinilll]. 
In special, chaos game representation (CGR)[12], which generates a two- 
dimensional square from a one-dimensional sequence, provides a technique 
to visualize the composition of DNA sequences. The characteristics of CGR 
images was described as genomic signature, and classification of species in 
the whole bacteria genome was analyzed by making an Euclidean metric be- 
tween two CGR images[13]. Based on the genomic signature, the distance 
between two DNA sequences depending on the length of nucleotide strings 
was presented [14] and the horizontal transfers in prokaryotes and eukaryotes 
were detected and charaterized[T5l [16]. Recently, a one-to-one metric repre- 
sentation of the DNA sequences |17j. which was borrowed from the symbolic 
dynamics, makes an ordering of subsequences in a plane. Suppression of 
certain nucleotide strings in the DNA sequences leads to a self-similarity of 
pattern seen in the metric representation of DNA sequences. Self-similarity 
limits of genomic signatures were determined as an optimal string length for 
generating the genomic signatures [18]. Moreover, by using the metric repre- 
sentation method, the recurrence plot technique of DNA sequences was estab- 
lished and employed to analyze correlation structure of nucleotide strings [T9]. 

As a eukaryotic organism, yeast is one of the premier industrial microor- 
ganisms, because of its essential role in brewing, baking, and fuel alcohol pro- 
duction. In addition, yeast has proven to be an excellent model organism for 
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the study of a variety of biological problems involving the fields of genetics, 
molecular biology, cell biology and other disciplines within the biomedical 
and life sciences. In April 1996, the complete DNA sequence of the yeast 
(Saccharomyces cevevisiae) genome, consisting of 16 chromosomes with 12 
million basepairs, had been released to provide a resource of genome informa- 
tion of a single organism. However, only 43.3% of all 6000 predicted genes in 
the Saccharomyces cerevisiae yeast were functionally characterized when the 
complete sequence of the yeast genome became available [20J. Moreover, it 
was found that DNA transposable elements have ability to move from place to 
place and make many copies within the genome via the transposition |2 1 [ [22] . 
Therefore, the yeast complete DNA sequence remain a topic to be studied 
respect to its genome architecture structure in the whole sequence. 

In this paper, using the metric representation and recurrence plot meth- 
ods, we analyze global transposable characteristics in the yeast complete 
DNA sequence, i.e., 16 chromosome sequences. 

2 Metric representation and recurrence plot 
methods 

For a given DNA sequence siS2 ■ ■ ■ Si ■ ■ ■ sn {si E {A, C, G, T}), a plane metric 
representation is generated by making the correspondence of symbol Sj to 
number /ij or z/j G {0, 1} and calculating values (a, (3) of all subsequences 
Sfc = S1S2 ■ ■ ■ Sk < k < N) defined as follows 



where fii is if Sj G {A, C} or 1 if Sj G {G, T} and z/, is if Sj G {A, T} or 1 
if Si G {G,G}. Thus, the one-dimensional symbolic sequence is partitioned 



a = 2E-=i/ifc-i+i3-^+3-'^ 
f3 = 2j:U^k-j+iS-^ + 



2E•=lyU^3-('^-^+l)+3-^ 
2Eil^^^3-(^-^+l) + 3-^ 



(1) 
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into A^" subsequences E^. and mapped in the two-dimensional plane {a,p). 
Subsequences with the same ending /-nucleotide string, which are labeled 
by E', correspond to points in the zone encoded by the /-nucleotide string. 
Taking a subsequence Ej e E' , we calculate 

e{ei - |E, - E,|) = Q{ei - ^(a, - a,)^ + (A - (2) 

where is the Heaviside function [Q{x) = 1, if a; > 0; 0(x) = 0, if x < 0] 
and Ej is a subsequence {j > I). When 6(e; — |Ej — Ej|) — 1, i.e., Ej e E', 
a point is plotted in a plane. Thus, repeating the above process from 
the beginning of one-dimensional symbolic sequence and shifting forward, we 
obtain a recurrence plot of the DNA sequence. 

For presenting correlation structure in the recurrence plot plane, a corre- 
lation intensity is defined at a given correlation distance d 

N-d 

E{d)^ ^e(Q-|E,-E,+d|). (3) 
1=1 

The quantity displays the transference of /-nucleotide strings in the DNA 
sequence. To further determine positions and lengths of the transposable 
elements, we analyze the recurrent plot plane. Since Ej and E^ e E', the 
transposable element has the length / at least. Prom the recurrence plot 
plane, we calculate the maximal value of x to satisfy 

e(e; - |E,+, - E,+,|) = 1, X = 0, 1, 2, • • • (4) 

i.e., T^i+x a-nd T^j+x £ E'. Thus, the transposable element with the correction 
distance d = j — i + 1 has the length L = l + x. The transposable element is 
placed at the position {i — I + l,i + x) and {j — I + l,j + x). 



4 



3 Global transposable characteristics in the 
yeast complete DNA sequence 



The Saccharomyces cevevisiae yeast has 16 chromosomes, which are denoted 
as YEAST I to XVI. Using the metric representation and recurrence plot 
methods, we analyze correlation structures of the 16 DNA sequences. Ac- 
cording to the characteristics of the correlation structures, we summarize the 
results as follows: 

(1) The correlation distance has a short period increasing. The YEAST I, 
IX and XI have such characteristics. Let me take the YEAST I as an example 
to analyze. Fig.l displays the correlation intensity at different correlation 
distance d{< N — I) with / = 15. A local region is magnified in the figure. 
It is clearly evident that there exist some equidistance parallel lines with a 
basic correlation distance db — 135. Using Eq. (4), we determine positions 
and lengths of the transposable elements in Table I, where their lengths 
are limited in L > 100. Many nucleotide strings have correlation distance, 
which is the integral multiple of dh- They mainly distribute in two local 
regions of the DNA sequence (25715-26845) and (204518-206554) or (11.2- 
11.7%) and (88.8-89.7%) expressed as percentages. The YEAST IX and XI 
have similar behaviors. The YEAST IX has the basic correlation distance 
db = 18. Many nucleotide strings (L > 50) with the integral multiple of 
db mainly distribute in a local region of the DNA sequence (391337-393583) 
or (89.0-89.5%) expressed as percentages. The YEAST XI has the basic 
correlation distance db — 189. Many nucleotide strings (L > 50) with the 
integral multiple of db mainly distribute in a local region of the DNA sequence 
(647101-647783) or (97.1-97.2%) expressed as percentages. 

(2) The correlation distance has a long major value and a short period 
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increasing. The YEAST II, V, VII, VIII, X, XII, XIII, XIV, XV and XVI 
liavc such characteristics. Let me take the YEAST II as an example to ana- 
lyze. Fig. 2 displays the correlation intensity at different correlation distance 
d{< N — I) with I — 15. The maximal correlation intensity appears at the 
correlation distance — 38534. A local region is magnified in the figure. It 
is clearly evident that there exist some equidistance parallel lines with a ba- 
sic correlation distance = 36. In Table II, positions and lengths (L > 50) 
of the transposable elements are given. The maximal transposable elements 
mainly distribute in two local regions of the DNA sequence (221249-224565, 
259783-263097) or (27.2-27.6%, 31.9- 32.4%) expressed as percentage. Near 
the positions, there also exist some transposable elements with approximate 
values for df,. Moreover, many nucleotide strings have correlation distance, 
which is the integral multiple of db. They mainly distribute in a local region 
of the DNA sequence (391337-393583) or (89.0-89.5%) expressed as percent- 
ages. In the other 9 DNA sequences, the YEAST V, X, XII, XIII, XIV, 
XV and XVI have the same basic correlation distance db = 36 and similar 
behaviors with different major correlation distance dm =49099, 5584, 9137, 
12167, 5566, 447110 and 45988, respectively. The YEAST VII and VIII have 
different basic correlation distance db — 12 and 135, and similar behaviors 
with the major correlation distance dm — 255548 and 1998, respectively. 

(3) The correlation distance has a long quasi-period increasing. The 
YEAST III has such characteristics. Fig. 3 displays the coherence inten- 
sity at different correlation distance d{< N — I) with / = 15. The correlation 
intensity has the maximal value at the correlation distance dmi — 185903 
and two vice-maximal values at the correlation distance dm.2 — 93625 and 
dm3 = 279528. Since dm2 ~ dmif^ ~ d^s/S, the coherence distance has 
a quasi-period increasing. A local region is magnified in the figure. These 
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does not exist any clear short period increasing of the correlation distance. 
Using Eq. (4), we determine positions and lengths (L > 50) of the transpos- 
able elements in Table 111. The maximal and vice-maximal transposable ele- 
ments mainly distribute in local regions of the DNA sequence (11499-13810, 
197402-199713), (198171-199796, 291794-293316) and (12268-12932, 291794- 
292460) or (3.6-4.4%, 62.6-63.6%), (62.8-63.4%, 92.5-93.0%) and (3.9-4.1%, 
92.5-92.7%) expressed as percentage. 

(4) The correlation distance has a long major value and a long quasi- 
period and two short period increasing. The YEAST IV has such char- 
acteristics. Fig. 4 displays the coherence intensity at different correlation 
distance d{< N — I) with I — 15. The maximal coherence intensity ap- 
pears at the correlation distance = 3885. There also exist three vice- 
maximal values at the correlation distance dmi = 232800, dm2 = 109349 and 
dm3 — 341221, which forms a long quasi-period increasing of the correlation 
distance, i.e., dm2 ~ dmi/2 fa dms/^- A local region is magnified in the fig- 
ure. It is clearly evident that there exist two short period increasing with 
dbi = 84 and di)2 = 192 in the correlation distance. In Table IV, positions 
and lengths (L > 100) of the transposable elements are determined by us- 
ing Eq. (4). All correlation distance with the long major value and the 
long quasi-period and two short period increasing are denoted. The trans- 
posable elements with dm, dmi, dm2, dms, dbi and db2 mainly distribute in 
local regions of the DNA sequence (527570-538236), (871858-876927, 981207- 
986276), (645646-651457, 878346-884257), (646379-651032, 987600-992253), 
(1307733-1308591) and (758135-759495) or (34.4-35.1%), (56.9-57.2%, 64.0- 
64.4%), (42.1-42.5%, 57.3-57.7%), (42.2-42.5%, 64.4-64.8%), (85.36-85.41%) 
and (49.5-49.6%) expressed as percentages. 

(5) The DNA sequence is hardly relevant. The YEAST VI has such 
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characteristics. Fig. 5 displays the coherence intensity at different correlation 
distance d{< N — l) with / = 15. The maximal coherence intensity appears at 
the correlation distance dm = 5627. A local region is magnified in the figure. 
The sequence has not a short period increasing of the coherence distance. 
In Table V, positions and lengths (L > 50) of the transposable elements are 
given. Only one nucleotide string with the length 337 has the correlation 
distance dm. The YEAST VI is almost never relevant, so the YEAST VI 
approaches a random sequence. 

4 Conclusion 

Global transposable characteristics in the yeast complete DNA sequence is 
determined by using the metric representation and recurrence plot meth- 
ods. Positions and lengths of all transposable nucleotide strings in the 16 
chromosome DNA sequences of the yeast are determined. In the form of 
the correlation distance of nucleotide strings, the fundamental transposable 
characteristics displays a short period increasing, a long quasi-period increas- 
ing, a long major value and hardly relevant. The 16 chromosome sequences 
are divided into 5 groups, which have one or several of the 4 kinds of the 
fundamental transposable characteristics. 
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Table I. Transference of nucleotide strings with lengths L{> 100) for the 
YEAST I with 230209 bases. 



No. 


String 


r 


Position 1 


Position 2 


L 


d 


Note 


1 


t^a-- 


- act 


11745-11969 


24177-24401 


225 


12432 




2 


ctg-- 


-ah 


12258-12396 


24711-24849 


139 


12453 




3 


g^a-- 


-g^a 


12988-13171 


25153-25336 


184 


12165 




4 


(?g-- 


-cgt 


25715-25851 


26255-26391 


137 


540 


44 


5 


at^-- 


- ac^ 


25739-25851 


26414-26526 


113 


675 


5^6 


6 


gta-- 


- ac^ 


25751-25851 


26561-26661 


101 


810 


6^6 


7 


gta-- 


- a(? 


25751-25851 


26696-26796 


101 


945 


7^6 


8 


t'g-- 


■g't 


25853-25968 


26393-26508 


116 


540 


Mb 


9 


atg-- 


■gtg 


25925-26035 


26060-26170 


111 


135 


db 


10 


atg-- 


-agt 


25925-26058 


26195-26328 


134 


270 


2dh 


11 


agt-- 


■gtg 


26050-26170 


26185-26305 


121 


135 


db 


12 


at^-- 


■gac 


26279-26406 


26414-26541 


128 


135 


db 


13 


gta-- 


-gac 


26291-26406 


26561-26676 


116 


270 


2db 


14 


gta-- 


-gac 


26291-26406 


26696-26811 


116 


405 


3db 


15 


gta-- 


■gtg 


26426-26710 


26561-26845 


285 


135 


db 


16 


tga-- 


- aca 


160239-160575 


165827-166163 


337 


5588 




17 


cac ■ ■ 


- tac 


204518-204802 


204653-204937 


285 


135 


db 


18 




- tac 


204567-204667 


205512-205612 


101 


945 


7db 


19 




■ tac 


204702-204802 


205512-205612 


101 


810 


6db 


20 


gH-- 


-aH 


204837-204949 


205512-205624 


113 


675 


5db 


21 


a(P - - 


• c^a 


204855-204969 


205395-205509 


115 


540 


44 


22 


etc - - ■ 


cat 


205042-205168 


205312-205438 


127 


270 


2db 


23 


cac ■ ■ 


■ act 


205058-205178 


205193-205313 


121 


135 


db 


24 


cac ■ ■ 


■ cat 


205193-205303 


205328-205438 


111 


135 


db 


25 


atg-- 


-t\ 


205758-205879 


206433-206554 


122 


675 


5db 
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Table II. Transference of nucleotide strings with lengths L(> 50) for the 
YEAST II with 813142 bases. Due to the limited spacing, 22 nucleotide strings in 

the total number 57 are not presented. 



No. 


String 


r 


Position 1 


Position 2 


L 


d 


Note 


1 


tag ■ ■ 


• agt 


1584-1650 


1692-1758 


67 


108 


?>dh 


2 


taq ■ ■ 


■ act 


1620-1674 


2016-2070 


55 


396 


114 


3 


tag ■ ■ 


• act 


1620-1674 


2268-2322 


55 


648 


184 


4 


tqc ■ ■ 


■tg'' 


1815-1870 


1851-1906 


56 


36 


df, 


5 


act ■ ■ 


- gta 


1860-1936 


1932-2008 


77 


72 


2db 


6 


tqc ■ ■ 


- gta 


1887-1936 


2355-2404 


50 


468 


134 


7 


toe • ' 


• qtq 


1959-2018 


2355-2414 


60 


396 


114 


8 


oca • • 


• aot 


2012-2082 


2264-2334 


71 


252 


7db 


9 


tg^-- 


• agt 


2022-2190 


2094-2262 


169 


72 


24 


10 


cao • • 


■ ata 


2037-2104 


2325-2392 


68 


288 


84 


11 


tg^-- 


- aqt 


2094-2154 


2274-2334 


61 


180 


54 


12 


cao • • 


■ ata 


2109-2176 


2325-2392 


68 


216 


64 


13 


tg^-- 


- agt 


2166-2226 


2274-2334 


61 


108 


34 


14 


cao • • 


■ qta 


2181-2248 


2325-2392 


68 


144 


44 


33 


a^t-- 


■cH 


221182-221231 


259718-259767 


50 


38536 




35 


gta ■ ■ 


■tct 


221249-221308 


259783-259842 


60 


38534 




37 


gat-- 


- tea 


222669-222827 


261203-261361 


159 


38534 


dm 


38 


ta^-- 


■tgc 


222829-223096 


261363-261630 


268 


38534 


dm 


39 


cat ■ ■ 


■get 


223098-223609 


261632-262143 


512 


38534 


dm 


40 


(]?C - - 


-g^a 


223611-223966 


262145-262500 


356 


38534 


dm 


41 


te2 ■ ■ • 


(?g 


223968-224563 


262502-263097 


596 


38534 


dm 


42 


ga^-- 


■ tac 


224736-225308 


263273-263845 


573 


38537 


~ dm 


43 


ata - - 


■ aca 


225310-225493 


263847-264030 


184 


38537 


~ dm 


44 


a(P - - 


■ a^c 


225841-226231 


264378-264768 


391 


38537 


~ dm 


45 


acg-- 


-t^ 


226233-226797 


264770-265334 


565 


38537 


~ dm 
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Table III. Transference of nucleotide strings with lengths k{> 50) for the 
YEAST III with 315341 bases. 



No. 


Strin] 


3 


Position 1 


Position 2 


L 


d 


Note 


1 




2 


90-141 


233-284 


52 


143 




2 


a^t-- 


2 

• ca 


1190-1253 


4083-4146 


64 


2893 




3 


t'g-- 


-gta 


11499-11764 


197402-197667 


266 


185903 


dml 


4 


tag-- 


- eta 


11766-11908 


197669-197811 


143 


185903 


dml 


5 


ta^-- 


-gae 


11910-12185 


197813-198088 


276 


185903 


dml 


6 


at^-- 


■gtg 


12187-13810 


198090-199713 


1624 


185903 


dml 


7 


tac - - 


- tat 


12268-12340 


291794-291866 


73 


279526 


~ dm3 


8 


ata ■ ■ 


-at^ 


12325-12932 


291853-292460 


608 


279528 


dm3 


9 


aH-- 


■gtg 


13691-13810 


293114-293233 


120 


279423 




10 


cg"^ ■ ■ 


■ ea^ 


13812-14006 


199715-199909 


195 


185903 


dml 


11 


cg"^ - - 


-t\ 


13812-13893 


293235-293316 


82 


279423 




12 


tgt-- 


- a^e 


83677-83802 


84419-84544 


126 


742 




13 


tgt-- 


- c?e 


83677-83802 


90049-90174 


126 


6372 




14 


cta - - 


- (j?e 


83724-83802 


83951-84029 


79 


227 




15 


cc? ■ ■ 


■tg^ 


83804-83902 


84031-84129 


99 


227 




16 


cc? ■ ■ 


■tg' 


83804-83902 


84546-84644 


99 


742 




17 


cc? • • 


■tq^ 


83804-83902 


90176-90274 


99 


6372 




18 


eta ■ ■ 


■tet 


83951-84230 


84466-84745 


280 


515 




19 


eta - - 


- tat 


83951-84189 


90096-90334 


239 


6145 




20 


tgt-- 


- tat 


84419-84704 


90049-90334 


286 


5630 




21 


eac - - 


-t^ 


123942-124058 


142661-142777 


117 


18719 




22 


tac - - 


- tat 


198171-198243 


291794-291866 


73 


93623 


~ dm2 


23 


ata ■ ■ 


-at^ 


198228-198835 


291853-292460 


608 


93625 


dm2 


24 


aH-- 


-t\ 


199594-199796 


293114-293316 


203 


93520 


~ dm2 


25 


g^a-- 


■g' 


267365-267574 


267692-267901 


210 


327 
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Table IV. Transference of nucleotide strings with lengths L{> 100) for the 
YEAST IV with 1531977 bases. Due to the limited spacing, 112 nucleotide strings 
without any notes in the total number 176 are not presented. 



No. 


Strin 


g 


Position 1 


Position 2 


L 


d 


Note 


28 


act ■ ■ 


-a^ 


527570-527835 


531455-531720 


266 


3885 




29 


act • • 


- a^t 


527570-527781 


535340-535551 


212 


7770 




30 


gt"-- 


- (?a 


527891-528066 


531776-531951 


176 


3885 


dm 


31 


gt^-- 


- (?a 


527891-528066 


535661-535836 


176 


7770 




32 


aa^ • 


--aH 


528094-531666 


531979-535551 


3573 


3885 




35 




■t^g 


531668-532364 


535553-536249 


697 


3885 




36 


gca ■ 


- - (?a 


532366-534351 


536251-538236 


1986 


3885 




40 


ata ■ ' 


■ - agt 


645546-646118 


878346-878918 


573 


232800 


dm2 


41 


aq^ ■ 


- - aca 


645635-645873 


992440-992678 


239 


346805 




42 


age • 


- - ate 


646120-646336 


878920-879136 


217 


232800 


dfji2 


43 


ct"-- 


- gta 


646338-646726 


879138-879526 


389 


232800 




44 




atg 


646379-646564 


987600-987785 


186 


341221 




45 




- cat 


646787-647179 


879587-879979 


393 


232800 


dm2 


46 


tat ■ ■ 


-t^c 


646964-647288 


988185-988509 


325 


341221 


dm3 


47 




ta^ 


647310-647470 


880110-880270 


161 


232800 


dm2 


48 




- tga 


647468-647693 


988689-988914 


226 


341221 




49 


gt'-- 


-cgt 


647472-648042 


880272-880842 


571 


232800 


dm2 


50 


gag- 


- - aga 


647695-649519 


988916-990740 


1825 


341221 


dm3 


51 


gat-- 


-cg^ 


648165-648265 


880965-881065 


101 


232800 


dm2 


52 


tc^-- 


-atg 


648486-649715 


881286-882515 


1230 


232800 


dm2 


53 


cgt-- 


-atg 


649521-649715 


990742-990936 


195 


341221 


dm3 


54 


ctg-- 


-t^ 


649853-650096 


991074-991317 


244 


341221 


dm3 


55 


tag-' 


■ - etc 


649946-650073 


882746-882873 


128 


232800 


dm2 


56 


gat-- 


- aca 


650075-651457 


882875-884257 


1383 


232800 


dm2 


57 


ctg-- 


-ct^ 


650098-651032 


991319-992253 


935 


341221 


dm3 
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60 


ag^ ■ 


■ ■ aca 


651219-651457 


992440-992678 


239 


341221 


^m<3 


63 


a(? ■ 


■ ■ (?a 


757478-757581 


757670-757773 


104 


192 


db2 


68 


eta • 


■ ■ act 


758135-758259 


758519-758643 


125 


384 


2db2 


69 


eta ■ 


■ ■ act 


758135-758259 


758711-758835 


125 


576 


3dh2 


70 


eta ■ 


■ ■ act 


758135-758259 


759479-759603 


125 


1344 


7db2 


71 


gca ■ 


2 

■ ■ q a 


758219-758343 


758411-758535 


125 


192 


db2 


72 


qca ■ 


■■aH 


758219-758347 


759179-759307 


129 


960 


5db2 


73 


qca ■ 


■ ■ g^a 


758219-758343 


759371-759495 


125 


1152 


6db2 


74 


ctq ■ 


■ ■ act 


758349-758451 


758925-759027 


103 


576 


3db2 


75 


ctg ■ 


■ ■ g^CL 


758349-758535 


759117-759303 


187 


768 


^db2 


76 


ctq ■ 


■ ■ cat 


758349-758683 


759309-759643 


335 


960 


5db2 


77 


eta ■ 


■ ■ cat 


758495-758683 


758687-758875 


189 


192 


db2 


78 


eta ■ 


■ ■ aga 


758687-758901 


759455-759669 


215 


768 


4^62 


79 


qca ■ 


■ ■ act 


758795-759027 


758987-759219 


233 


192 


db2 


80 


eta ■ 


■ ■ act 


758903-759027 


759287-759411 


125 


384 


2db2 


81 


gca ■ 


■ ■ aga 


758987-759093 


759563-759669 


107 


576 


Sdb2 


82 


eta ■ 


■ ■ q'^a 


759095-759303 


759287-759495 


209 


192 


db2 


99 


tgt- 




871858-872030 


981207-981379 


173 


109349 




103 


tqc ■ 


• • ca^ 


872202-872307 


981551-981656 


106 


109349 


dml 


104 


a^q ■ 


■ ■ cag 


872309-872592 


981658-981941 


284 


109349 


dml 


105 


at^ ■ 


■ ■ tat 


872737-872871 


982086-982220 


135 


109349 


dml 


106 


a(? ■ 


■ ■ CO? 


873022-873286 


982371-982635 


265 


109349 


dml 


107 


tga- 


■ ■ cat 


873378-874087 


982727-983436 


710 


109349 


dml 


108 


tea ■ 


■■gH 


874089-874236 


983438-983585 


148 


109349 


dml 


109 


tac ■ 


■■tgc 


874238-874593 


983587-983942 


356 


109349 


dml 


110 


aga ■ 


■ ■ ate 


874595-874764 


983944-984113 


170 


109349 


dml 


111 




■ cc? 


874853-875247 


984202-984596 


395 


109349 


dml 


112 


gat- 


■ ■ c?e 


875249-875604 


984598-984953 


356 


109349 


dml 
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113 


2 

ca • 


■ ■ tga 


875637-876474 


984986-985823 


838 


109349 




114 


age ■ 


■ ■ tga 


876491-876927 


985840-986276 


437 


109349 


dull 


117 


q^a ■ 


■ ■ aga 


877085-877385 


986434-986734 


301 


109349 


dml 


118 


qa^ ■ 




877387-877657 


986736-987006 


271 


109349 


dml 


169 


cat ■ ■ 


■ a(? 


1307733-1307835 


1308321-1308423 


103 


588 


7dbi 


170 


ac? ■ 


■g^c 


1307749-1307874 


1308505-1308630 


126 


756 


9dbi 


171 




■ ate 


1307753-1307871 


1307921-1308039 


119 


168 


2dbi 


172 


at'-- 


■ ate 


1307921-1308039 


1308509-1308627 


119 


588 


7dbi 


173 




■ a,tc 


1308089-1308249 


1308341-1308501 


161 


252 


3dbi 


174 


acP ■ ■ 


■ ■ ate 


1308169-1308333 


1308253-1308417 


165 


84 


dbi 


175 


ac^ ■ ■ 


■ ■ ae^ 


1308337-1308507 


1308421-1308591 


171 


84 


dbi 
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Table V. Transference of nucleotide strings with lengths L(> 50) for 

YE AST VI with 270148 bases. 

No. String Position 1 Position 2 L d Note 

1 tat---aca 137905-138238 143532-143865 334 5627 dm 

2 ga'^---t^ 178016-178086 178157-178227 71 141 

3 ca^-'-gtc 178088-178157 178229-178298 70 141 

4 tgt---gtg 210332-210391 210334-210393 60 2 
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Figure caption 

Fig. 1. A plot of correlation intensity versus correlation distance d 
for the YEAST 1. 

Fig. 2. A plot of correlation intensity H((i) versus correlation distance d 
for the YEAST II. 

Fig. 3. A plot of correlation intensity S(d) versus correlation distance d 
for the YEAST III. 

Fig. 4. A plot of correlation intensity versus correlation distance d 
for the YEAST IV. 

Fig. 5. A plot of correlation intensity E{d) versus correlation distance d 
for the YEAST VI. 
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